NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improve Temporal Awareness of LLMs for Sequential Recommendation

Chu, Zhendong; Wang, Zichao; Zhang, Ruiyi; Ji, Yangfeng; Wang, Hongning; Sun, Tong (June 2024, 1st ICML Workshop on In-Context Learning at ICML 2024)

Full Text Available
Retrieval-based Controllable Molecule Generation

Wang, Zichao; Nie, Weili; Qiao, Zhuoran; Xiao, Chaowei; Baraniuk, Richard; Anandkumar, Anima (May 2023, International Conference on Learning Representations (ICLR) 2023)

Generating new molecules with specified chemical and biological properties via generative models has emerged as a promising direction for drug discovery. However, existing methods require extensive training/fine-tuning with a large dataset, often unavailable in real-world generation tasks. In this work, we propose a new retrieval-based framework for controllable molecule generation. We use a small set of exemplar molecules, i.e., those that (partially) satisfy the design criteria, to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. We design a retrieval mechanism that retrieves and fuses the exemplar molecules with the input molecule, which is trained by a new self-supervised objective that predicts the nearest neighbor of the input molecule. We also propose an iterative refinement process to dynamically update the generated molecules and retrieval database for better generalization. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning. On various tasks ranging from simple design criteria to a challenging real-world scenario for designing lead compounds that bind to the SARS-CoV-2 main protease, we demonstrate our approach extrapolates well beyond the retrieval database, and achieves better performance and wider applicability than previous methods.
more » « less
Full Text Available
Improving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rank

https://doi.org/10.18653/v1/2023.bea-1.22

Ashok Kumar, Nischal; Fernandez, Nigel; Wang, Zichao; Lan, Andrew (January 2023, Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023))

Full Text Available
Open-ended Knowledge Tracing for Computer Science Education

Liu, Naiming; Wang，Zichao; Baraniuk, Richard; Lan, Andrew (December 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Interpretable Math Word Problem Solution Generation via Step-by-step Planning

https://doi.org/10.18653/v1/2023.acl-long.379

Zhang, Mengxue; Wang, Zichao; Yang, Zhichao; Feng, Weiqi; Lan, Andrew (January 2023, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
DeepHull: Fast Convex Hull Approximation in High Dimensions

https://doi.org/10.1109/ICASSP43922.2022.9746031

Balestriero, Randall; Wang, Zichao; Baraniuk, Richard G. (May 2022, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Computing or approximating the convex hull of a dataset plays a role in a wide range of applications, including economics, statistics, and physics, to name just a few. However, convex hull computation and approximation is exponentially complex, in terms of both memory and computation, as the ambient space dimension increases. In this paper, we propose DeepHull, a new convex hull approximation algorithm based on convex deep networks (DNs) with continuous piecewise-affine nonlinearities and nonnegative weights. The idea is that binary classification between true data samples and adversarially generated samples with such a DN naturally induces a polytope decision boundary that approximates the true data convex hull. A range of exploratory experiments demonstrates that DeepHull efficiently produces a meaningful convex hull approximation, even in a high-dimensional ambient space.
more » « less
Full Text Available
Automated Scoring for Reading Comprehension via In-context BERT Tuning

https://doi.org/10.1007/978-3-031-11644-5_69

Fernandez, Nigel; Ghosh, Aritra; Liu, Naiming; Wang, Zichao; Choffin, Benoit; Baraniuk, Richard G.; Lan, Andrew S. (July 2022, International Conference on Artificial Intelligence in Education)

Full Text Available
Open-ended Knowledge Tracing for Computer Science Education

https://doi.org/10.18653/v1/2022.emnlp-main.254

Liu, Naiming; Wang, Zichao; Baraniuk, Richard; Lan, Andrew (January 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Scientific Formula Retrieval via Tree Embeddings

https://doi.org/10.1109/BigData52589.2021.9671942

Wang, Zichao; Zhang, Mengxue; Baraniuk, Richard G.; Lan, Andrew S. (December 2021, 2021 IEEE International Conference on Big Data (Big Data))

Full Text Available
The Recurrent Neural Tangent Kernel

Alemohammad, Sina; Wang, Zichao; Balestriero, Randall; Baraniuk, Richard (May 2021, The International Conference on Learning Representations)

The study of deep neural networks (DNNs) in the infinite-width limit, via the so-called neural tangent kernel (NTK) approach, has provided new insights into the dynamics of learning, generalization, and the impact of initialization. One key DNN architecture remains to be kernelized, namely, the recurrent neural network (RNN). In this paper we introduce and study the Recurrent Neural Tangent Kernel (RNTK), which provides new insights into the behavior of overparametrized RNNs. A key property of the RNTK should greatly benefit practitioners is its ability to compare inputs of different length. To this end, we characterize how the RNTK weights different time steps to form its output under different initialization parameters and nonlinearity choices. A synthetic and 56 real-world data experiments demonstrate that the RNTK offers significant performance gains over other kernels, including standard NTKs, across a wide array of data sets.
more » « less
Full Text Available

« Prev Next »

Search for: All records